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ABSTRACT 

There are several techniques fo* fitting a straight line to a collection of data points. In the 
expression Y ~ a + bX the parameters of interest are tile intercept, a, and slope, b. Herein these 
parameters are subscripted by y if they are derived by minimizing the variance in Y, by x if the 
variance in X is minimized, and by xy if a reduced major axis analysis issued (see text). The cor- 
relation coefficient is designated by r. This paper notes that the slopes and correlation coeffi- 
cients are related through r 2 =b„/b= (b v /b v „) 2 . The corresponding standard deviations and 
correlation coefficient are related by r 2 = S h /S h =S h /S. . 
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RELATIONSHIPS AMONG THE SLOPES OF LINES DERIVED FROM 
VARIOUS DATA ANALYSIS TECHNIQUES AND 
THE ASSOCIATED CORRELATION COEFFICIENT 

This note points some simple and insightful features associated with fitting a straight line to data 
which may be useful to, but widely unknown by, many members of the scientific community. Spe- 
cifically there exist relations connecting the correlation coefficient and slopes of the lines derived 
using alternative data analysis criteria. In the following discussion Xj and Y. are to be regarded as 
observations, neither of which can be preferentially regarded as an independent or dependent variable. 
The analyses use the data along with error minimization criteria to derive the intercept, a, and slope, 
b, in the assumed relationship between X and Y, namely Y = a + bX. If the variance in Y is minimized, 
the derived constants are 

b = NSX.Y, - SXj SYj = S^ ^ 
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where S xy is the covariance of X and Y, S x is the standard deviation in X, Y and X are mean values 
and the sums are taken over the N sets of observations. Alternatively if the variance in X is minimized 
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a x = Y-b x X (2b) 

Taking the ratio of the slope b„ to the slope b v results in an expression which can be recognized as 
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the square of the correlation coefficient, r, between X and Y, i.e. 
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In general b x > by since b y is derived from minimizing the variance in Y and b x from minimizing 
the variance in X, The slopes are equal if the data is perfectly correlated, r- “ 1 ; otherwise, the de- 
gree to which the slopes differ is a measure of the departure from perfect correlation. 

An alternative procedure for fitting a straight line to data, one which treats X and Y in a mom 
symmetric way than those so lar considered, minimizes the sum of the triangular areas formed by 
the derived straight line and lines parallel to the coordinate axes through the data points. This 
reduced major axis formulation results in (Kcrmack and Haldane, 1950) 
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It now follows that 
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Thus if two of the parameters r, b x , b y , ami b xy are known, the other two can be determined with 
tittle further effort. The standard deviation in the slopes are also related tc one another and the 
correlation coefficient by 


St, = St, = r~ Si, 
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The proceeding expressions are useful for one set of straight line parameters from another, a procedure 
which is aided by the expression for the intercept, ,ij = Y - lv X where j - x, y, or xy. They are also 
useful for assessing the consequences of either using an inappropriate minimization criterion or in- 
verting n linear regression expression t:o got X “ A + BY where A = -a/b and B = 1 lb without consider- 
ing the implied changes in the selection of independent and dependent variables. 
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